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Abstract 

We provide a physical interpretation of the Kirchhoff index of any molecules as well as of the 
Wiener index of acyclic ones. For the purpose, we use a local vertex invariant that is obtained 
from first principles and describes the atomic displacements due to small vibrations/oscillations of 
atoms from their equilibrium positions. In addition, we show that the topological atomic 
displacements correlate with the temperature factors (B -factors) of atoms obtained by X-ray 
crystallography for both organic molecules and biological macromolecules. 
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1. Introduction 

Many topological ideas have been introduced in chemistry in an ad hoc way [1]. A classical 
example is provided by the oldest topological index, which is nowadays known as the Wiener 
index, W [2]. It is defined as the sum of all shortest-path distances between (non-hydrogen) 
atoms in a molecule. This index correlates very well with many physico-chemical properties of 
organic molecules [3]. Several attempts to provide a physico-chemical interpretation of W have 
been conducted. In one of them, W has been shown to represent a rough measure of the 
molecular surface area [4]. More recently, Gutman and Zenkevich [5] have shown that this index 
is related to the internal energy of organic molecules, with a special role played by the vibrational 
energy. 

In general, very few approaches to defining topological invariants in chemistry start from 
first-principle physical concepts, deriving indices which are physically sound and chemically 
useful. An attempt to define a topological index along the line of this strategy was developed by 
Klein and Randic, who defined the so-called Kirchhoff index, Kf [6]. The Kirchhoff index is 
defined in an analogous way to the Wiener index but by using the concept of resistance distance 
r^j between pairs of nodes instead of the shortest-path distance. Despite that the index uses well- 
known concepts from physics such as Ohm's and Kirchhoff s laws [6], it is not straightforward to 
realise what the "electrical resistance" means for a chemical bond. These difficulties have urged 
us to search for a first-principle approach to defining topological invariants with a clear physico- 
chemical interpretation and that solve existing chemico-structural problems. 

Here we derive a local vertex invariant from first principles which describes the atomic 
displacements due to small vibrations/oscillations of atoms from their equilibrium positions. 
Using this approach we provide a clear and unambiguous physical interpretation of the Kirchhoff 
index of any molecule in terms of atomic displacements. We show here that the Kirchhoff index is 
the sum of the squared atomic displacements produced by small molecular vibrations or 
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oscillations of atoms from their equilibrium positions. For acyclic molecules as the ones studied 
by Gutman and Zenkevich [5], our results explain the relationship between the Wiener index and 
vibrational molecular energy. The topological atomic displacements are shown here to correlate 
with the temperature factors (B-factors) of atoms obtained by X-ray crystallography. We illustrate 
our results for both organic molecules and proteins. 

2. Background 

Here we represent molecules as graphs G = (v.e), where nodes represent united atoms and edges 
represent physical interactions between such united atoms. In the simplest case of an alkane 
molecule the nodes represent the united atoms CH^ , where n = 0,1,2,3 , and the edges are the 

covalent C-C bonds; in other words, the graph corresponds to the hydrogen-depleted molecular 
graph. However, we are not constrained here to such representation. For instance, a protein can be 
represented through its residue interaction graph/network [7]. In this approach the nodes are 
united-atom representations of the amino acids, centred at their atoms, with the exception of 

glycine for which is used. Two nodes are then connected if the distance r.j between both 
atoms of the residues i and j is not longer than a certain cutoff value . The elements of the 
adjacency matrix of the residue network are obtained by 




where h{x>0)=1 and h{x <0) = 0, We use =7.0 A [7] hereafter. 



The Wiener index W is defined as [2] 




(1) 
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where d^j is the shortest-path distance between atoms / and j in the molecular graph. In the case 

of a molecular network like a residue network the Wiener index divided by the number of nodes 
has been used as a criterion for defining 'small-world' networks. 

The Kirchhoff index is defined as [6] 



where the resistance distance r^j between nodes / and j in a graph is obtained through the 
Moore-Penrose generalised inverse of the Laplacian matrix [8], as 



The Laplacian matrix is defined as L = D-A, where D is the diagonal matrix of node degrees 
and A is the adjacency matrix. It is well-known that for acyclic molecules the Wiener and the 
Kirchhoff indices coincide. 



We now consider the classical analogy in which the atoms are represented by balls and bonds are 
identified with springs with a common spring constant k [9]. We would like to consider a 
vibrational excitation energy from the static position of the molecule. Let x. denote the 

displacement of an atom / from its static position. Then the vibrational potential energy of the 
molecule can be expressed as 




(2) 



(Li,+(r),-2(r), 



(3) 



3. 



Topological Atomic Displacements 



v{x) = — x^Lx , 



(4) 



where x is the vector whose / th entry x, is the displacement of the atom / . 
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Now we are going to suppose that the molecule is immersed into a thermal bath of inverse 

temperature p , where is the Boltzmann constant. Then the probability distribution of 

kj 

the displacement of the nodes is given by the Boltzmann distribution 



P\x) = = — exp X Lx 

^ ^ Z Z \ 2 

where the normalization factor Z is the partition function of the molecule 



(5) 



Z = |(ixexp|^-^x^Lxj. (6) 
The mean displacement of an atom / can be expressed by 

(7) 

We can calculate this quantity once we can diagonalise the Laplacian matrix L . Let us 
denote by U the matrix whose columns are the orthonormal eigenvectors y/^ and A the diagonal 

matrix of eigenvalues of the Laplacian matrix. Note here that the eigenvalues of the Laplacian 

of a molecular graph are positive except for one zero eigenvalue. Then, we write the Laplacian 
spectrum as ^ = < ^2 ^ • • • ^ An important observation here is that the zero eigenvalue does 

not contribute to the vibrational energy. This is because the mode // = 1 is the mode where all the 
atoms (balls) move coherently in the same direction and thereby the whole molecule moves in one 
direction. In other words, this is the motion of the centre of mass, not a vibration. 
In calculating Eqs. (5) and (6), the integration measure is transformed as 



dx = Y\dx. = Ideti/IP^^/y. = dy (8) 

i=\ i=\ 

because the determinant of the orthogonal matrix, dett/, is either ±1. Then we have 
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(9) 



Note again that because /l^ = the contribution from this eigenvalue obviously diverges. This is 
because nothing stops the whole molecule from moving coherently in one direction. When we are 
interested in the vibrational excitation energy within the network, we should offset the motion of 
the centre of mass and focus on the relative motion of the nodes. We therefore redefine the 
partition function by removing the first component ju = l from the last product. We thereby have 



ju=2 



V ^ J 



(10) 



Next we calculate the mean displacement Ax. defined by Eq. (7). We first compute the numerator 
of the right-hand side of Eq. (7) as follows: 



V 2 



/; = J dSxf exp 
= \dy{lJy^^QM-^f Ay 



2 

exp 



^ n n 



(11) 



On the right-hand side, any terms with v^y will vanish after integration because the integrand is 
an odd function with respect to and . The only possibility of a finite result is due to terms 



with v = Y , We therefore have 
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,v=l 



TT ( Pk , , 



(12) 



v=2 



//=1 

JU^V 



where we separated the contribution from the zero eigenvalue and those from the other ones. Due 
to the divergence introduced by the zero eigenvalue we proceed the calculation by redefining the 
quantity /. with the zero mode removed: 



v=2 



M=2 



2 ' ' 



8/1 1 2;r 



(13) 



We therefore arrive at the following expression for the mean displacement of an atom: 



(14) 



If we designate by the Moore-Penrose generalised inverse of the graph Laplacian [8], which 
has been proved to exist for any molecular graph, then it is straightforward to realise that 



(15) 



4. Kirchhoff and Wiener indices revisited 



From now on we are going to consider the case pk=\ for the sake of simplicity. Then, it is easy 



to see that due to the orthonormality of the eigenvectors of the inverse Laplacian, we have 
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t(A^,f=tr(v)=±^=^Kf(G), (16) 

That is, the Kirchhoff index of a molecular graph is simply the sum of the squared atomic 
displacements produced by small molecular vibrations multiplied by the number of atoms in the 
molecule. Since it is well-known that for acyclic molecules, i.e., molecular trees, the Kirchhoff 
and Wiener indices coincide, we also have 

w(T)=nf^{Ax,f . (17) 
Then the potential energy (4) can be expressed as 

n 

Let i?. = be the sum of all resistance distances from atom / to any atom in the molecule, i.e., 

7=1 

the sum of the / th row (or column) of the resistance distance matrix. That is, 
Ri = Z + - )• It is known that Y.Lt=0, Then, 

7=1 7=1 

R,=nLl, +tr(r)=n{Ax,y , (18) 



i=l 



This relation indicates that {Ax-f and R. are linearly related for the atoms of a given molecule. 

Using Eq. (18) we can express the potential energy (4) in terms of the resistance distance of the 
atoms in the molecular graph 

2n^ n n ^ J 2n 

where (k) is the average degree of the molecular graph. The first term in the right-hand side of 
Eq. (19) was already introduced by Estrada et al. [10] as a topological index obtained from the 



quadratic form ^v|D|u^, where v is a vector of node degrees, D is the distance matrix and u is 
an all-one vector. 

In summary, the normalised Kirchhoff index of a molecular graph represents the sum of 
squared displacements of atoms due to molecular vibrations and the sum of resistance distances 
for a given atom depends linearly on the square of the displacement of the corresponding atom. 

The term {Ax.f has a very clear physical interpretation. It represents the atomic displacement due 

to molecular vibrations. Small values of (Ax.)^ indicate that those atoms are very rigid in the 
molecule. For instance, in 2,2,3 -trimethylbutane the smallest displacement is obtained for the 
carbon atom connected to three methyl groups Ax^ = 0.534 , followed by the one bounded to two 

CH3 groups, Ax^^ = 0.655 . Then, the methyl groups display the largest displacements, 
AXf^fj^ =1.000 for those at position 2 and Axch, =1-069 for those at position 3. 
5. Topological Displacements and Temperature Factors 

We guess that the atomic displacement Ax. should display some linear correlation with an 

experimental measure of how much an atom oscillates or vibrates around its equilibrium position. 
Such experimental measure is provided by X-ray experiments as the so-called B-factor or 
temperature factor, and represents the reduction of coherent scattering of X-rays due to the 
thermal motion of the atoms. For instance, in the molecule of naphthalene the atomic 
displacements of carbon atoms correlate very well with the experimental B-factors (in 
parenthesis) [11]: 0.898 (4.6 A^), 0.815 (4.0 A^) and 0.615 (3.4 A^), which gives a correlation 
coefficient r=Q96. The following correlation coefficients are obtained for: anthracene [12] ( 
r^99); phenanthrene [13] (r=Q95); pyrene [14] (r=099); and triphenylene [15] ( 
r=Q99). In these cases we averaged the values of B-factors for equivalent carbon atoms. In 
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Fig. 1 we plot the values of Ax. versus the B-factors for the carbon atoms of anthracene and 
pyrene. 

Insert Fig. 1 about here. 

The B-factors are quite relevant for the study of protein structures as they contain valuable 
information about the dynamical behaviour of proteins and several methods have been designed 
for their prediction [16]. It is known that regions with large B-factors are usually more flexible 
and functionally important. The atomic displacements have been used previously by Bahar et al. 
[17] to describe thermal fluctuations in proteins. We note in passing that we use here a residue 
network representation of the protein based on p -carbons instead of the a -carbons used by 
Bahar et al. 

For the sake of illustration we have selected here the lipase b from Candida antarctica 
(Itca) [18]. In this case we obtain a correlation coefficient r = 0.7 4 between the experimental B- 
factors and the topological atomic displacements. For this protein Yuan et al. [19] reported 
r = 0.6 3 for predicting the experimental B-factors. In Fig. 2 (top) we illustrate the profiles for the 
normalised B-factors and the topological atomic displacements of residues for this protein. We 
also represent in Fig. 2 (bottom) the 20 residues with the highest values of Ax. in the molecular 

structure of the protein. We recall that the residues with the largest values of the atomic 
displacements are those displaying the highest flexibility in the protein. Here we have represented 
these residues by using blue colour for the atoms in these residues. We also represent the 20 
residues with the lowest values of Ax. , which correspond to those displaying the highest rigidity 

in the protein. They are coloured in red in the molecular structure of the protein. As can be seen 
the most flexible amino acids are those which are on the surface of the protein, while the most 
rigid ones are concentrated around the protein core. 

Insert Fig. 2 about here. 
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The new relationship obtained here between the topological atomic displacements and the 
sum of the resistance distances for a given atom, i.e., the expression (17), opens up new 
possibilities for interpreting Ax. in a given molecule. According to Eq. (17) the topological 
displacements for the atoms in a molecule depend only on the sum of the resistance distances for 

the corresponding atom, e.g., (Ax.)^ ~ — T!^/ [6]. It is known that if there is more than one path 

connecting two atoms in a molecule, i.e., there are cycles, the resistance distance is smaller than in 
the case when there is only a single path. Then, if there is one oscillation/vibration in one atom 
which is transmitted to all the other atoms through the different paths connecting them, the 
vibration is attenuated along every path. Consequently, a small value of Ax. is due to the fact that 

the atom / is part of a large number of paths connecting it to other atoms. This implies that when 
the other atoms oscillate/vibrate their effect is very much attenuated before arriving to / . 
6. Conclusions 

We have developed a theoretical approach based on classical molecular mechanics to 
accounting for small displacements of atoms from their equilibrium positions due to oscillations 
or vibrations. The topological atomic displacements are expressed in terms of the eigenvalues and 
eigenvectors of the discrete Laplacian matrix of the molecular graph. Using this approach we have 
given a clear and unambiguous physical interpretation of the Kirchhoff index as well as of the 
Wiener index of acyclic molecules. It explains previous empirical results clearly, showing that 
these indices are related to vibrational energy of alkanes and dithioderivative compounds. More 
importantly, the topological atomic displacements are well correlated with the B-factors obtained 
by X-ray crystallography. 
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Figure Captions 

Fig. 1. Linear correlation between the topological atomic displacements and experimental B- 
factors for carbon atoms of anthracene (empty circles) and pyrene (empty squares). The 
temperature factors of equivalent atoms were averaged. 

Fig. 2. Profiles of the topological atomic displacements (solid line) and the B-factors (dotted line) 
for lipase b from Candida antarctica, Itca (top), and illustration of the amino acids having the 20 
largest (blue) and 20 smallest (red) values of the topological atomic displacements (bottom). 
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Fig.l 
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Fig. 2 




